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ABSTRACT 


Two  mininum  cardinality  set  covering  problems  of  similar  structure 
are  presented  as  difficult  test  problems  for  evaluating  the  computational 
efficiency  of  integer  programming  and  set  covering  algorithms.  The  smaller 
problem  has  117  constraints  and  27  variables  and  the  larger  one,  constructed 
by  H.  J.  Ryser,  has  330  constraints  and  45  variables.  The  constraint 
matrices  of  the  two  set  covering  problems  are  incidence  matrices  of  Steiner 
triple  systems.  An  optimal  solution  to  the  problem  that  we  were  able  to 
solve  the  smaller  one)  gives  some  new  information  on  the  1 -widths  of 
meBA)crs  of  this  class  of  (0,1) -matrices. 


This  research  was  sponsored.  In  pait,  by  National  Science  Foundation  Grants 
GK-32Z82X  and  GP-32316X  and  Office  of  Naval  Research  Gontract  No. 
N00014 -67 -A-0077  -0028  to  Cornell  University  and  the  United  States  Army 
under  Contract  No.  DA-31-124 -ARO-D-462. 


Two  Computational!/  Difficult  Set  Covering  Problems  That  Arise  in  Computing 
the  1-Width  of  Incidence  Matrices  of  Steiner  Triple  S/stems 

D.  R.  Fulkerson,  G.  L.  Nemhauser  and  L,  E.  Trotter,  Jr. 

1.  Introduction.  The  purpose  of  this  note  is  two-fold.  First,  we  supply 

data  for  two  integer  progranming  (set  covering)  problems  which  we  believe 
are  con^utationally  hard.  Our  experience  indicates  that  optimal  solutions 
to  these  problems  are  vety  tedious  to  conqpute  and  verify,  even  though  they 
have  rar  fewer  variables  than  numerous  solved  problems  in  the  literature. 

Thus  these  problems  seem  to  be  appropriate  difficult  test  problems  for 
evaluating  the  computational  efficiency  of  integer  programming  and  set 
covering  algorithms. 

The  constraint  matrices  of  the  two  set  covering  problems  are  incidence 
matrices  of  Steiner  triple  systems.  An  optimal  solution  to  the  problem 
that  we  were  able  to  solve  (the  smaller  one)  gives  some  new  information 
on  the  1-widths  of  members  of  this  particular  class  of  (0,1) -matrices. 

This  new  information  is  the  second  purpose  of  this  note. 

2_j _ Origin  of  the  Problems.  The  a-width  of  a  (0,1) -matrix  A  is  the 

minimum  number  of  columns  that  can  be  selected  from  A  so  that  all  row 
sums  of  the  resulting  subraatrix  of  A  are  at  least  a.  Here  o  is  an 
integer  parameter  ranging  from  zero  to  the  sraallect  row  sum  of  the  matrix 
A.  Tnis  notion  was  introduced  and  studied  by  Fulkerson  and  Ryser  in  a 
scries  of  papers  [3,  4,  5].  Henceforth  we  restrict  attention  to  the  case 
0=1  and  denote  the  1 -width  of  an  m  by  n  (0,1) -matrix  A  having 
no  zero  rows  by  w  (A) .  The  integer  w  (A)  can  thus  be  determined  from  an 
optimal  solution  to  the  set  covering  problem 

(2.1)  w(A)  »  min  l^x, 

— "  ID 

X  ^  0  and  integral . 
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University. 


Here  is  the  k>vector  all  of  whose  components  are  1,  and  we  are  viewing 

A  as  the  incidence  matrix  of  m  elements  (rows)  vs.  n  subsets  (columns) 
of  the  m>set.  (Alternatively,  and  this  point  of  view  is  perhaps  more 
appropriate  for  the  discussion  to  follow,  we  can  view  (2.1)  as  the  problem 
of  determining  the  least  number  of  elements  of  an  n-set  required  to  represent 
all  members  of  a  family  of  m  subsets  of  the  n>set.) 

A  Steiner  triple  system  on  n  elements  is  a  pair  (S,T)  where 
S  •  {1,2,. ..  ,n}  and  T  ■  (S,  ,...,S_)  is  a  family  of  triples  (subsets  of 
S  of  cardinality  three)  such  that  every  pair  of  elements  of  S  (every 
subset  of  S  of  cardinality  two)  is  contained  in  precisely  one  of  the 
triples.  It  is  well  known  that  Steiner  triple  systems  exist  if  and  only 
if  n  3  and  n  =  1,  3  (mod  6),  in  tdiich  case  m  =  ,  and  each 

element  of  S  appears  in  precisely  ^  ~  ^  of  the  triples.  (Steiner  triple 
systems  are  particular  cases  of  coiii}inatorial  configurations  known  as 
balanced  inconplete  block  designs,  which  are  used  in  the  statistical  design 
of  experiments.  For  exai^le,  if  n  drugs  are  to  be  tested  on  m  patients, 
and  each  patient  is  to  be  given  three  drugs,  a  Steiner  triple  system  pro¬ 
vides  a  design  in  which  each  pair  of  drugs  is  tested  on  one  patient.) 

The  incidence  matrix  A  •  (a^^)  of  a  Steiner  triple  system  is  a 
(0,1) -matrix  whose  rows  correspond  to  the  triples  and  whose  columns  corre¬ 
spond  to  the  elements  of  S.  Thus  a^^  «  1  if  and  only  if  j  c  S^. 
Corresponding  to  each  of  the  parameters  n  «  3,  7,  9  there  is  a  unique 
Steiner  triple  system;  for  n  ■  13,  there  are  two  distinct  systems,  and 
for  n  ■  15,  there  are  eighty  distinct  systems  [14].  The  unique  system 
for  n  *  9  is  given  by  the  12  by  9  incidence  matrix 
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where  0  is  the  zero  matrix  of  order  3,  I  is  the  identity  matrix  of 
order  3,  and 


ro  1  in 


z 


1  0  1 


Li  1  oj 


For  the  system  (2.2),  we  have  w(Ag)  «  5,  since  the  rows  of  are 
covered  by  its  first  five  columns,  but  by  no  smaller  set  of  columns.  Both 
systems  corresponding  to  n  ■  13  have  1-width  7;  for  n  «  15,  the  1-width 
varies  from  7  to  9  [5].  Indeed,  in  their  study  of  1-widths  of  Steiner 
triple  systems,  Fulkerson  and  Ryser  [5]  derived  the  lower  boiind 

(2.3)  w(A)>."-^ 

and  determined  conditions  under  which  equality  holds  in  (2.3).  They  remarked, 

however,  that  good  iq)per  bounds  seem  difficult  to  obtain.  Since  all  triple 

2 

systems  on  15  or  fewer  elements  have  1 -widths  at  most  jn  -  1,  they  at 
one  time  speculated  that  this  might  be  an  upper  bound.  Ryser  siibsequently 
constructed  a  45  element  system  as  a  potential  counter-exaiq)le,  and  con¬ 
jectured  that  it  had  1-width  30.  Ryser' s  construction  of  the  45-element 
system  starts  with  a  particular  15-element  system  that  has  1 -width  9.  This 
15 -element  system  has  the  35  by  15  incidence  matrix 

E  0 
Z  E 
0  Z 
I  I 


(2.4) 
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where 


0  0  110 
0  0  0  1  1 
1  0  0  0  1 
110  0  0 
0  110  0 
0  10  0  1 
10  10  0 
0  10  10 
0  0  10  1 
10  0  10 


1  0  0  0  0 
0  10  0  0 
0  0  10  0 
0  0  0  1  0 
0  0  0  0  1 
1  0  0  0  0 
0  10  0  0 
0  0  10  0 
0  0  0  1  0 
0  0  0  0  1 


and  I  is  the  S  by  5  identity.  He  then  constructs  a  4S-element  system 
having  a  330  by  45  incidence  matrix  of  the  following  form: 


(2.5) 


In  (2.5)  I  is  the  15  by  15  identity  matrix;  for  k  «  1,...,15  is 

a  IS  by  15  matrix  whose  column  is  all  ones  and  whose  other  columns 

]( 

are  all  zeros,  and  P  is  a  15  by  15  permutation  matrix  with 


» 


where  J  is  the  matrix  of  all  ones. 
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We  attempted  to  determine  by  solving  the  set  covering  problem 

(2.1),  but  did  not  succeed.  We  then  used  the  matrix  from  (2.2)  to 
construct  a  27-element  system  structurally  similar  to  The  resulting 

117  by  27  Incidence  matrix  is 


(2.6) 


Ag  0  0 

0  Ag  0 

0  0  Ag 

I 


2 

We  solved  (2.1)  for  and  found  that  J”  "  ^ 

not  an  upper  bound  on  1-widths  of  Steiner  triple  systems  on  n  elements. 
The  system  (2.5)  constructed  by  Ryser  has  ^(A^j)  ^  30,  and  we  feel 
reasonably  sure  that  Ryser's  speculation  that  wCA^g)  *  30  is  correct. 

More  detailed  descriptions  of  A^^  and  A^^  are  given  in  the 
Appendix. 


3.  Computational  Experience.  An  optimal  linear  programming  solution  in 

both  problems  is,  of  course,  to  set  all  variables  equal  to  j  ,  giving 

values  of  9  and  15,  respectively,  for  the  sum  of  variables  in  the  two 

problems.  Optimal  linear  programming  bases  are  not  unique  for  A^^  and 

A^^;  the  computer  produced  optimal  bases  with  determinants  of  magnitude 
3  3 

(27)  and  (45)  ,  respectively.  An  optimal  solution  to  (2.1)  with  A  >  A^^ 
is  given  by  columns  1,  2,  3,  4,  5,  6,  10,  11,  12,  13,  14,  IS,  19,  20,  21,  25, 


26,  27. 
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Both  problems  were  atten^ted  by  an  implicit  enumeration  algorithm  [17] 
and  a  cutting  plane  algorithm^ based  on  Gomory's  method  of  integer  forms  [7]. 

All  computing  was  done  on  a  Univac  1108.  We  also  sent  to  several 

people  who  have  obtained  coaputational  experience  with  integer  programming 
codes  different  from  those  available  to  us.  Of  particular  interest  would 
be  results  obtained  from  codes  based  on  the  group-theoretic  approach  to 
integer  programming  [2,  8,  9,  10]^  and  codes  based  on  specialized  algo¬ 
rithms  for  set  covering  problems  [1,  11,  13,  16].  Nobody  has  informed  us  that 
they  have  solved  (2.1)  for  A^^.  There  were  a  few  negative  responses. 

These  generally  did  not  report  unequivocal  failure,  but  difficulty  in 
treating  a  problem  of  this  size  with  an  experimental  code,  etc. 

Using  the  cutting  plane  code,  we  solved  (2.1)  for  A  &  A^  in  about 
20  seconds  after  adding  44  cuts.  The  cutting  plane  code  was  unsuccessful 
on  the  larger  triple  systems  A^^,  A^^,  and  A^^.  Typically,  the  first 
few  iterations  yielded  cuts  that  caused  an  increase  in  the  objective  value, 
but  this  was  followed  by  loig  sequences  of  cuts  that  failed  to  produce 
a  significant  change  in  the  value  of  the  objective  function.  For  example, 
in  treating  A^^,  the  initial  sequence  of  19  cuts  raised  the  objective 
value  from  5.0  to  5.425,  after  which  32  cuts  left  the  value  unaltered. 

(Recall  that  w(Ajj1  *  9.)  This  particular  run  took  82  seconds.  In  con¬ 
trast,  this  problem  was  solved  in  5  seconds  using  implicit  enumeration. 

This  enumeration  algorithm  is  very  similar  to  one  developed  and  tested  by 
Geoffrion  [6].  It  uses  single  tests  for  infeasibilities,  and  uses  linear 

programming  relaxations  to  obtain  lower  bounds  and  thus  a  necessary  condition 

t'The  particular  cutting  plane  code  used  was  DULPDX,  which  is  available  on  the  MACC 
Program  Library  at  the  University  of  Wisconsin,  Madison. 

^Shapiro  [IS]  attenq;)ted  to  solve  (2.1)  with  A^^  using  IPA  [9].  The 

enumeration  was  abandoned  after  about  1  minute  of  computer  time  on  the  IBM 
360/67.  Optimal  solutions  found  for  the  group  problems  were  highly  infeasible 
and  little  progress  was  being  made  towards  achieving  feasibility  in  the 
enumeration  phase. 
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on  whether  a  given  partial  solution  to  (2.1)  has  a  feasible  completion  of 
smaller  value  than  the  best  available  solution.  The  linear  programming 
solutions  also  generate  surrogate  constraints.  A  dual  algorithm  is  used 
«  to  solve  ths  linear  programs.  When  fathoming  occurs,  the  next  partial 

solution  considered  is  determined  by  backtracking.  Branching  is  done  by 
fixing  a  free  variable  at  zero  or  one. 

In  our  successful  run  with  began  with  the  partial  solution 

corresponding  to  the  optimal  solution,  which  was  found  by  inspection.  About 
6000  partial  solutions  were  considered  before  optimality  was  verified.  The 
run  consumed  about  16  minutes  of  con;)uter  time.  Most  of  the  fathoming 
came  from  the  L.P.  bounds,  but  this  bounding  test  was  quite  ineffective  for 
partial  solutions  having  a  small  number  of  fixed  variables.  Several  atteiqpts 
at  solving  the  problem  for  A^^  failed,  although  we  began  with  what  is 
probably  an  optimal  solution. 

In  contrast  with  our  experience  on  these  problems,  others  have  reported 
considerable  success  in  solving  set  covering  problems  with  a  variety  of 
algorithms  (1,  6,  9,  10,  11,  13,  16].  For  exanqple,  Geoffrion  [6]  has 
reported  solving,  with  an  algorithm  essentially  identical  to  our  implicit 
enumeration  algorithm,  several  set  covering  problems  with  m  ■  30  and 
30  ^  n  ^  90  in  times  varying  from  one  to  twelve  seconds  on  the  IBM  7044, 
a  much  slower  computer.  Lemke  et  al.  [13]  have  solved  considerably  larger 
problems  in  a  reasonable  amount  of  time,  e.g.  a  problem  with  m  «  SO,  n  »  4S0 
was  solved  in  about  2  minutes  on  an  IBM  360/50.  Gorry  et  al.  [9]  have 
solved  some  airline  crew  scheduling  problems;  one  with  m  •  313  and  n  *  482 
'  was  solved  in  about  3  minutes  on  the  IBM  360/85. 

Compared  to  most  of  the  covering  problems  considered  in  the  literature, 
the  problems  and  A^^  have  a  relatively  small  number  of  variables 
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but  a  large  number  of  constraints.  Supposedly,  how’vr,  of  the  two  param¬ 
eters,  the  number  of  variables  is  the  more  significant  in  solving  a  covering  < 

problem  by  implicit  enumeration.  Furthermore,  the  densities  (number  of 
ones/mn)  of  and  are  close  to  the  densities  in  the  problems 

considered  in  [6]  and  [13]  mentioned  above. 

Why,  then,  are  these  two  problems  and  A^^  difficult?  We  don't 

really  know,  but  some  plausible  explanations  might  be: 

1.  The  symmetries  in  the  problems  no  doubt  tend  to  increase  the 
amount  of  enumeration  required. 

2.  The  rather  large  determinants  of  optimal  L.P.  bases  contribute  to 

the  unattractiveness  of  cutting  plane  methods  and  indicate  that  group-theoretic 
methods  may  encounter  difficulties. 

t 

3.  The  optimal  value  in  the  integer  problem  is  large  compared  to  the 

optimal  value  in  the  real  (or  rational)  problem,  thus  emasculating  the  power  * 

of  linear  programming. 

Recently  Jeroslow  [12]  has  constructed  a  simple  family  of  n  variable, 

(O-l)-integcr  programs  that  cannot  be  solved  by  implicit  enumeration,  even 

n  f  2 

using  linear  programming  for  fathoming,  without  enumerating  at  least  2 
possibilities.  Although  k^^  and  A^^^  are  not  in  this  family,  they  show 
that  problems  that  arise  naturally  can  be  nasty.  We  hope  that  a  reasonable 
set  of  such  hard  problems  can  be  accumulated  ior  the  purpose  of  evaluating 
the  efficiency  of  proposed  integer  progranuning  algorithms. 
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